cd/entity/DeepSeek v4Β· homeβ€Ί entitiesβ€Ί DeepSeek v4
grep -l @deepseek v4 /news/*.json | wc -l β†’ 1

@DeepSeek v4

mentions 1 type Person feed RSS
10:16
2026-04-27
ianbarber.blog
large-language-models

Loss Exploded.

Meta's FAIR team documented a series of training failures in 2021 for their OPT-175B model, including repeated loss explosions and learning issues that required extensive hyperparameter tuning and arc…

// co-occurs with top 5 entities